Journals
  Publication Years
  Keywords
Search within results Open Search
Please wait a minute...
For Selected: Toggle Thumbnails
Short utterance speaker recognition algorithm based on multi-featured i-vector
SUN Nian, ZHANG Yi, LIN Haibo, HUANG Chao
Journal of Computer Applications    2018, 38 (10): 2839-2843.   DOI: 10.11772/j.issn.1001-9081.2018030598
Abstract535)      PDF (731KB)(306)       Save
When the length of the test speech is sufficient, the information and discrimination of single feature is sufficient to complete the speaker recognition task. However, when the length of the test speech was very short, the performance of speaker recognition is decreased significantly due to the small data size and insufficient discrimination. Aiming at the problem of insufficient speaker information under the short speech condition, a short utterance speaker recognition algorithm based on multi-featured i-vector was proposed. Firstly, different acoustic feature vectors were extracted and combined into a high-dimensional feature vector. Then Principal Component Analysis (PCA) was used to remove the correlation of the feature vectors, so that the features were orthogonalized. Finally, the most discriminating features were picked out by Linear Discriminant Analysis (LDA), which led to reduce the spatial dimension. Therefore, this multi-featured system can achieve a better speaker recognition performance. With the TIMIT corpus under the same short speech (2 s) condition, the experimental results showed that the Equal Error Rate (EER) of the multi-featured system decreased respectively by 72.16%, 69.47% and 73.62% compared with the single-featured systems including Mel-Frequency Cepstrum Coefficient (MFCC), Linear Prediction Cepstrum Coefficient (LPCC) and Perceptual Log Area Ratio (PLAR) based on i-vector. For the different lengths of the short speech, the proposed algorithm provided rough 50% improvement on EER and Detection Cost Function (DCF) compared with the single-featured system based on i-vector. Experimental results fully indicate that the multi-featured system can make full use of the speaker's characteristic information in the short utterance speaker recognition, and improves the speaker recognition performance.
Reference | Related Articles | Metrics
Gesture segmentation and positioning based on improved depth information
LIN Haibo, WANG Shengbin, ZHANG Yi
Journal of Computer Applications    2017, 37 (1): 251-254.   DOI: 10.11772/j.issn.1001-9081.2017.01.0251
Abstract560)      PDF (753KB)(555)       Save
Aiming at the problem that segmented gesture by Kinect depth information usually contains wrist data, which easily causes subsequent false gesture recognition, a gesture segmentation and positioning algorithm based on improved depth information was proposed. Firstly, the gesture binary image was detected based on depth information threshold limit in experimental space. Secondly, according to characteristics of common gestures, accurate gesture was segmented by gesture endpoint detection and variable threshold algorithm. In order to obtain stable segmentation results, morphological processing of segmented gesture was conducted. Lastly, the gesture positioning algorithm was proposed based on the method of combining gesture gravity center coordinates and maximum inscribed circle center coordinates. The experimental results show that the proposed gesture segmentation method has better accuracy and stability than the existing algorithm. The combined gesture positioning is more stable than gesture gravity center positioning and skeletal data positioning of Kinect Software Development Kit (SDK) and it has no singular points.
Reference | Related Articles | Metrics